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The puzzle of combination frequencies found in heat-driven pulsators 
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Searching for combinations in the frequency spectra of variable stars is a commonly used method within the asteroseis- 
mological community, as harmonics and linear combinations of individual frequencies are expected to appear not only by 
chance, but also as a characteristic signature linked to different physical phenomena, e.g., nonlinear oscillations, binarity, 
and rotation. Furthermore it is very important to identify independent frequencies for modelling purposes. We show that 
using high precision data sets delivered by recent space missions, the distinction between combinations having a real phys- 
ical meaning and spurious combinations which only appear by chance gets more and more difficult. We demonstrate how 
careful one should be with the interpretation of such combination frequencies by presenting the statistical distributions 
derived from artificial data sets. Based on comparisons to observations, we find that, despite the high statistical probability 
of finding combinations in random data sets (having similar properties to the ones of real stars), there is a significant differ- 
ence in the number of the lowest order combinations between stars with and starts without real combination frequencies. 
This way, a search for frequency combinations is very useful when interpreted properly, and when results are compared to 
simulations. 
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1 Introduction 

1.1 Frequency analysis of variable stars 

The light (and radial velocity) variations of pulsating stars - 
especially of heat-driven oscillators (for a full overview of 
pulsating stars and asteroseismology, see |Aerts et al.|2010| ) 
which pulsate in a regular, periodic pattern - can be eas- 
ily described with a model constructed from a set of sine 
functions, each of which having well defined parameters. 
Using an iterative prewhitening procedure to recover the in- 
dividual modes is a commonly used practice (for a detailed 
description of the method and discussion on different signif- 
icance criteria see, e.g.,|Degroote et aL|[2009; Papics et ah] 



2012). This process results in a list of amplitudes (Aj), fre- 
quencies (fj), and phases (6j), by which the light curve can 
be modelled via rif frequencies in the well-known form of 



F(td = c + V Aj sm[2n(fjti + Qj)l 



7=1 



1.2 Combination frequencies 

If a selected frequency fj from the complete set of observed 
frequencies can be written as 

nf 1 

fj = Yj Ci ^ ±A f = Cl f l + C 2/2 + • • • + C nf f nf ± -, 

i.e., as a linear combination of independent fi frequencies 
present in the power spectrum (allowing only a small Af 
deviation from the exact combination value according to the 



frequency resolution, with T being the timebase of the full 
dataset, and where the individual coefficients are c t e Z), 
then we term fj a combination frequency. We define 



O 



'fj 



2' 



to be the combination order, and using this notation, the 
combination order tells us if we are dealing with inde- 
pendent (Of j = 0), indistinguishable (Of. = 1), or com- 
bination (Of j > 2) frequencies. In the latter case, f are 
termed parent frequencies when the corresponding coeffi- 
cients Ci ± 0. Harmonics are a subset of combinations where 
only Ci=k±j > 2 while every other coefficient - 0. 

In recent years, space-based instruments like C0R0T 
( Auvergne etldT||2009l ) and Kepler { jGilliland et"aL][20T0l ) 
opened up the era of almost uninterrupted data with a preci- 
sion typically two orders of magnitude better than ground- 
based photometry, enabling us to study the power spectrum 
of pulsations without serious aliasing problems combined 
with a much lower noise level than before. This results in 
an at least an order of magnitude increase in the number of 
significant frequencies found in the power spectrum of pul- 
sating stars. This increase in peak density over regions of 
the periodogram gives rise to ambiguous identifications or 
misinterpretations of combinations and harmonics. 

2 Statistics of combination frequencies 

2.1 Creating artificial light curves 

It is very important to decide if combination frequencies are 
real or not to make a proper physical interpretation of the 
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detected signals. Does it make sense to talk about high or- 
der combinations in terms of physical phenomena, or, given 
a massive set of frequencies, will we always find them sim- 
ply by chance? To test this, we simulated thousands of light 
curves having a similar but random power spectrum as the 
main sequence pulsating B s tars in our Kepler Guest Ob- 
servers ample (Pap ics et al. j in preparation). For each ar- 
tificial light curve we generated data sets based on a ran- 
dom number of frequencies selected from a typical range of 
observed real-life frequencies. In addition, we have given 
these frequencies random amplitudes having a distribution 
which resembles the observed frequency distribution of our 
Kepler sample. Very realistic light curves emerged (exam- 
ples can be seen on Fig. [I]), randomly showing beating pat- 
terns with both short and long timescales, or features which 
- in case of real stars - we tend to interpret as signs of spot- 
ted stars or binarity, and sometimes even quasi-equidistant 
splittings or frequency series. 

2.2 Searching for combinations 

We used a conservative method to look for combination fre- 
quencies using the subset of prewhitened frequencies with 
the fifty highest amplitudes. We set the maximum number 
of combination terms C[fi to 3, while allowing |c/| e [0,3], 
to avoid extremely high - and physically unlikely - combi- 
nation orders. We assume that combination peaks will have 
lower amplitudes than their parents, so we only try linear 
combinations in the subset where / < j, and where previ- 
ously frequency was found to be independent (so we do 
not use combination peaks as parents). To be as conserva- 
tive as possible, we use an amplitude criterion requiring the 
relation 



O 



'fj 



to be true for peaks accepted as combination frequencies, 
which - in an empirical way - ensures that the A t ampli- 
tudes of the parent peaks are strong enough to create a com- 
bination with the given Aj amplitude. 

These tests enabled us to check what is the probability of 
finding harmonics with a given order O in a random dataset, 
just by chance, without forcing the appearance of harmonics 
during the generation of these simulated light curves. Look- 
ing at the distributions from a run of simulations containing 
one thousand artificial data sets in Fig.[2]we can see that, on 
average, only 17.3% of the 50 strongest peaks are identified 
as independent, 4.5% as O = 2 combinations, while 78.2% 
as higher order combinations. The distribution of indepen- 
dent frequencies and = 2 combinations are relatively nar- 
row, so significant deviations will stand out very clearly. 

2.3 Comparison to observations 

We used the same method outlined in Sect. l2.2l for the fre- 
quency spectra of the nine stars of our Guest Observer sam- 
ple (we analysed Q1-Q6 data for these tests) to see if the 
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Fig. 3 The same as the bottom right subplot in Fig. ||but 
now showing the distribution of combination frequencies 
with a given order when the appearance of O = 2 combi- 
nations is enforced during the simulations. 



observed number of given order combinations fall in the ex- 
pected distributions (see Fig. [2]). For most of the stars, and 
most of the combination orders this was the case. Signifi- 
cant deviations can only be seen in three cases and only at 
combinations having = 2. For these stars the observed 
number of O = 2 combinations is much higher than pre- 
dicted by the simulations, which means that there has to be 
a real, physical effect behind the occurrence of combination 
frequencies. Forcing the occurrence of O = 2 combinations 
in the artificial data sets, we create much better matching 
distributions for these stars (see Fig. [3]). 

3 Conclusions 

Seismic modelling of heat-driven oscillators can not be suc- 
cessful using hundreds of frequencies without mode identi- 
fication constraints, as raising the number of modes which 
are used in the fitting process quickly raises the complexity 
beyond the reach of our computing power, or simply beyond 
the precision of our present-day models. 

Filtering out combination frequencies when looking for 
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a set of independent frequencies can be difficult ( [Balona 
2011; Uytterhoev en et al. 2011| ), e.g., the presence of bi- 
nary components or rotation can give rise to very long se- 
ries of harmonics ( |Balona et aTJ|20Tfl |Clarke||j003] ), and 
one can also treat rotationally split multiplets as a kind of 
combinations if the rotation period is present in the power 
spectrum (Breg er et al.|2011 ). Most of these effects can be 
correctly interpreted from spectroscopy, or by looking for 
clear rotationally split multiplets in the power spectrum, so 
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Fig. 1 Example cutout sections of artificial light curves based on the sampling and noise properties of the long cadence 
data from the Kepler satellite and the pulsation properties - as frequency ranges and amplitude distribution - of main 
sequence B stars. The artificial light curves (with added random white noise) are plotted with black dots, while the red 
solid curves show the corresponding noise-free models constructed from the generated parameter sets. 



in ideal cases these should be already known when search- 
ing for combinations. In case of rotational or binary signa- 
ture, higher order harmonics are expected to appear, so these 
should be identified making sure that results from photom- 
etry and spectroscopy agree with each other. 

After this step it does not make sense to look for high or- 
der combinations, as our tests showed that those appear very 
easily from random sets even when a conservative search 
method is used. We strongly suggest to look at the observed 
number of O = 2 combinations and compare it to statis- 
tics of artificial sets created using frequency ranges and am- 
plitude distributions which represent the related observed 
quantities well. Only if there is a significant deviation be- 
tween the observed and predicted number of combinations, 
a clear signature of an underlying physical effect should be 
concluded, otherwise any combinations found in the data 
are most probably mere coincidence. Only when the pres- 
ence of O = 2 combinations is proven one should proceed 
and try to identify higher order combinations. Additional 
spectroscopy might be very useful to help deciphering the 
physical origin of combination frequencies. Ignoring such 
data can easily lead to meaningless identification and over- 



interpretation of high order multiplets in variable star re- 
search. 
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Fig. 2 The observed number of combination frequencies per order for our nine Kepler stars (shown in individual subplots 
with the corresponding KIC numbers, plotted with cyan coloured asterisk symbols) compared to the statistical distributions 
(plotted as log scale coloured 2D histograms in the background where colours denote the number (log N) of occurrences of 
having n combination frequencies with a given order (O) in the full set of artificial data sets, and where the average values 



per order are also plotted with white circles) from simulations described in Sect. 2.1 and Sect. 2.2 
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